add rag kuberay and jupyterhub image #440

chiayi · 2024-03-25T20:49:52Z

Running the rag example notebook will only take around 6-8 minutes in total.

chiayi · 2024-03-25T20:50:40Z

/gcbrun

imreddy13

small nits, otherwise LGTM

applications/rag/main.tf

modules/jupyter/variables.tf

modules/kuberay-cluster/variables.tf

modules/jupyter/jupyter_image/notebook_image/requirements.txt

chiayi · 2024-03-25T21:09:57Z

/gcbrun

chiayi · 2024-03-25T22:28:03Z

/gcbrun

chiayi · 2024-03-25T23:37:24Z

Going to try again as it failed due to trying to port fowarding rag-frontend which this PR shouldn't really be affecting

chiayi · 2024-03-25T23:37:31Z

/gcbrun

modules/jupyter/variables.tf

modules/kuberay-cluster/main.tf

chiayi · 2024-03-26T18:33:35Z

/gcbrun

chiayi · 2024-03-26T18:36:34Z

/gcbrun

chiayi · 2024-03-26T19:38:20Z

First attempt succeeded and the second attempt failed at calling webhook.

chiayi · 2024-03-26T19:38:27Z

/gcbrun

* add rag kuberay and jupyterhub image (#440) * Rollback to previous image (#454) * Ray Webhook Support for Single-Host, Multi-Slice TPUs (#453) * Fix incorrect replicaIndex for single-host, multi replica * Fix single-host, multi-slice deletion logic * Update README & simplify workloads.tfvars for RAG (#445) * RAG marketplace updates (#456) * fix RAG marketplace changes --------- Co-authored-by: Chia-Yi Liang <chiayiliang327@gmail.com> Co-authored-by: zlq <zlq@google.com> Co-authored-by: ryanaoleary <113500783+ryanaoleary@users.noreply.github.com> Co-authored-by: imreddy13 <132504814+imreddy13@users.noreply.github.com>

andrewsykim · 2024-03-27T13:22:51Z

modules/kuberay-cluster/main.tf

@@ -44,7 +44,8 @@ resource "helm_release" "ray-cluster" {
      security_context                  = local.security_context
      secret_name                       = var.db_secret_name
      cloudsql_instance_connection_name = local.cloudsql_instance_connection_name
-      image_tag                         = var.enable_gpu ? "2.9.3-py310-gpu" : "2.9.3-py310"
+      image                             = var.use_custom_image ? "us-central1-docker.pkg.dev/ai-on-gke/rag-on-gke/ray-image" : "rayproject/ray"
+      image_tag                         = var.enable_gpu ? "2.9.3-py310-gpu" : var.use_custom_image ? "2.9.3-py310-gpu" : "2.9.3-py310"


@chiayi can we simplify this by also publishing 2.9.3-py310 to artifact registry? Even withput GPUs I think this would benefit

Yes, I will work on creating an image for 2.9.3-py310 as well.

andrewsykim · 2024-03-27T15:52:30Z

modules/jupyter/main.tf

@@ -121,6 +121,8 @@ resource "helm_release" "jupyterhub" {
    gcs_bucket          = var.gcs_bucket
    k8s_service_account = var.workload_identity_service_account
    ephemeral_storage   = var.ephemeral_storage
+    notebook_image      = "jupyter/tensorflow-notebook"


Was this meant to be updated with the artifact registry image?

andrewsykim · 2024-03-27T15:53:14Z

modules/kuberay-cluster/variables.tf

+
+variable "use_custom_image" {
+  type        = bool
+  description = "If running RAG, set this var to true to use custome image with pre-installed lib"


This description shouldn't mention RAG, it's a separate variable unrelated to RAG, but we'll set it to true in the RAG deployment

andrewsykim · 2024-03-27T15:54:28Z

applications/rag/main.tf

@@ -228,6 +228,7 @@ module "kuberay-cluster" {
  grafana_host           = module.kuberay-monitoring.grafana_uri
  disable_network_policy = var.disable_ray_cluster_network_policy
  depends_on             = [module.kuberay-operator]
+  use_custom_image       = true


We shouldn't hardcode to true, can we define a top-level variable and pass it down?

chiayi requested review from umeshkumhar, blackzlq and imreddy13 March 25, 2024 20:49

imreddy13 approved these changes Mar 25, 2024

View reviewed changes

applications/rag/main.tf Outdated Show resolved Hide resolved

modules/jupyter/variables.tf Outdated Show resolved Hide resolved

modules/kuberay-cluster/variables.tf Outdated Show resolved Hide resolved

blackzlq reviewed Mar 25, 2024

View reviewed changes

modules/jupyter/jupyter_image/notebook_image/requirements.txt Show resolved Hide resolved

chiayi force-pushed the jupyterhub-rag branch from 6fe04a6 to 2f4f948 Compare March 25, 2024 21:06

andrewsykim reviewed Mar 26, 2024

View reviewed changes

modules/jupyter/variables.tf Outdated Show resolved Hide resolved

modules/kuberay-cluster/main.tf Outdated Show resolved Hide resolved

chiayi force-pushed the jupyterhub-rag branch 3 times, most recently from cd639b4 to 8d3ff81 Compare March 26, 2024 18:33

add rag kuberay and jupyterhub image

e1a840b

chiayi force-pushed the jupyterhub-rag branch from 8d3ff81 to e1a840b Compare March 26, 2024 18:35

chiayi merged commit e543774 into main Mar 26, 2024
8 checks passed

andrewsykim reviewed Mar 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add rag kuberay and jupyterhub image #440

add rag kuberay and jupyterhub image #440

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

imreddy13 left a comment

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

chiayi commented Mar 26, 2024

chiayi commented Mar 26, 2024

chiayi commented Mar 26, 2024

chiayi commented Mar 26, 2024

andrewsykim Mar 27, 2024

chiayi Mar 27, 2024

andrewsykim Mar 27, 2024

andrewsykim Mar 27, 2024

andrewsykim Mar 27, 2024

add rag kuberay and jupyterhub image #440

add rag kuberay and jupyterhub image #440

Conversation

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

imreddy13 left a comment

Choose a reason for hiding this comment

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

chiayi commented Mar 25, 2024

chiayi commented Mar 26, 2024

chiayi commented Mar 26, 2024

chiayi commented Mar 26, 2024

chiayi commented Mar 26, 2024

andrewsykim Mar 27, 2024

Choose a reason for hiding this comment

chiayi Mar 27, 2024

Choose a reason for hiding this comment

andrewsykim Mar 27, 2024

Choose a reason for hiding this comment

andrewsykim Mar 27, 2024

Choose a reason for hiding this comment

andrewsykim Mar 27, 2024

Choose a reason for hiding this comment